Implement a new --failing-and-slow-first command line argument to test runner. #24624

juj · 2025-06-26T16:36:13Z

This keeps track of results of previous test run, and on subsequent runs, failing tests are run first, then skipped tests, and last, successful tests in slowest-first order. This improves parallelism throughput of the suite.

Add support for --failfast in the multithreaded test suite to help stop suite runs at first test failures quickly.

These two flags --failfast and --failing-and-slow-first together can help achieve < 10 second test suite runs on a CI when the suite is failing.

Example core0 runtime with test/runner core0 on a 16-core/32-thread system:

Total core time: 2818.016s. Wallclock time: 118.083s. Parallelization: 23.86x.

Same suite runtime with test/runner --failing-and-slow-first core0:

Total core time: 2940.180s. Wallclock time: 94.027s. Parallelization: 31.27x.

Gaining a better throughput and a -20.37% test suite wall time.

…t runner. This keeps track of results of previous test run, and on subsequent runs, failing tests are run first, then skipped tests, and last, successful tests in slowest-first order. Add support for --failfast in the multithreaded test suite. This improves parallelism throughput of the suite, and helps stop at test failures quickly.

…failfast mode.

…failure result.

sbc100

IIUC this is what I currently use --failfast --continue for. The downside of --failfast --continue of course is that it doesn't work for parallel testing (so I also add -j1).

.gitignore

sbc100

Actually maybe I misunderstood. I use --failfast --continue when implementing new features and wanting to fix each test failure as I run into it.

How does this improve CI times on the bots? It seems like it would not effect the first run, but only subsequent runs, which the bots don't do, do they?

juj · 2025-08-13T14:30:29Z

How does this improve CI times on the bots? It seems like it would not effect the first run, but only subsequent runs, which the bots don't do, do they?

It doesn't work on the current CircleCI bots, which always start from a clean slate and run all suites from a single command invocation, but it does help if a developer runs test suites locally, and on the ad hoc CI I am running in http://clbri.com:8010/ .

For example, here is one such run:

where all the failing suites fail in a matter of a few seconds, rather than taking a random length to fail.

Also passing suites run faster, since shortest tests are run last, meaning that core utilization will be 100% throughout the test suite run. It is like a self-calibrating version to avoid having to name tests test_zzz_ if they are slow. (which is detrimental to test speed)

juj added 6 commits June 26, 2025 19:33

ruff

1008e6d

ruff

28e4ab8

Improve test run information to flow across suites when running in --…

a8544b2

…failfast mode.

Refactor num_failures into a fail_frequency to get a more normalized …

78aa3fb

…failure result.

ruff

1fd0b64

sbc100 reviewed Jul 7, 2025

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

sbc100 reviewed Jul 7, 2025

View reviewed changes

Save previous test run file to out/

b3cec56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement a new --failing-and-slow-first command line argument to test runner. #24624

Implement a new --failing-and-slow-first command line argument to test runner. #24624

juj commented Jun 26, 2025 •

edited

Loading

Uh oh!

sbc100 left a comment

Uh oh!

Uh oh!

sbc100 left a comment

Uh oh!

juj commented Aug 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Implement a new --failing-and-slow-first command line argument to test runner. #24624

Are you sure you want to change the base?

Implement a new --failing-and-slow-first command line argument to test runner. #24624

Conversation

juj commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sbc100 left a comment

Choose a reason for hiding this comment

Uh oh!

juj commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

juj commented Jun 26, 2025 •

edited

Loading

juj commented Aug 13, 2025 •

edited

Loading